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-Abstract- 

Given an undirected graph G = [V,E) with \V\ = n, and a function / : y —s- N, we consider 
the problem of finding a connected /-factor in G. This problem is NP-Complete when f(v) > rf 
for every v in V and a contant e > 0. We design an algorithm to check for the existence of a 
connected /-factor, for the case where f{v) > n/g{n), for all v in V and g{n) is polylogarithmic 
in n. The running time of our algorithm is As a consequence of this algorithm, we 

conclude that the complexity of connected /-factor for the case we consider is unlikely to be 
NP-Complete unless the Exponential Time Hypothesis (ETH) is false. Secondly, under the ETH 
assumption, we show that the problem is also unlikely to be in P for g{n) in ©((logfor 
any constant e > 0. These results show that for each e > 0 and g(n) in ©((logn)^"'"'^), connected 
/-factor problem for f{v) > n/g{n) is in NP-Intermediate unless the ETH is false. Further, for 
any constant c > 0, when g{n) = c, our algorithm for connected /-factor runs in polynomial 
time. Finally we extend our algorithm to compute a minimum weight connected /-factor in edge 
weighted graphs in the same asymptotic time bounds. 
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[T] Introduction 

Let G = (y, i?) be an undirected graph with n vertices and / ; H —>■ N be a function. An 
/-factor of G is a spanning subgraph H such that dniv) = f{v), for each v in V. The 
problem of deciding whether a given graph G has an /-factor is a well studied problem over 
many years and the problem is shown to be polynomial time solvable by 

Tutte [^. When edges have weights, a simple modification to Tutte’s reduction solves the 
minimum weighted /-factor problem. 

A connected /-factor is an /-factor which is connected. For the case when f{v) = 2 for 
all u in y, a connected /-factor is a Hamiltonian cycle 20 and is NP-Complete to decide. 


In fact, Cheah and Cornell showed that the connected /-factor problem is NP-Complete 
where f{v) = d for each u in y and an integer constant d > 1. For f{v) > |"f] for every v 
in y, deciding a connected /-factor is same as deciding whether there exists an /-factor or 
not. This is because in this case, any /-factor turns out to be connected, due to Ore 13 
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and Dirac [^. 

Past Work on Connected Factors. There has been an extensive study on connected 
[a, 6]-factors in the literature over the past twenty years. An [a, 6]-factor is a subgraph H 
of a graph G such a < dniv) < b, for each v in V. There are many results on sufficiency 
conditions for a graph to have a connected [a, 6]-factor. For example, when S{G) > ^ 
the Graph is Hamiltonian, due to Ore 13 and Dirac [^. Also, if the sum of degrees 


of every pair of non-adjacent vertices is at least n — 1, then the graph has a Hamilton 
path, and this is a connected [1, 2]-factor. Similarly, by relating the size of the maximum 
independent set and the vertex connectivity of a graph, there are sufficiency conditions for 


the existence of connected [a, &]-factors. The survey article by Kouider and Vestergaard 19 
and Plummer 


14 present more results on connected /-factors. 


Our Work. To the best of our knowledge, our study is the first of this kind in the area of 
connected factors. We are motivated by this line of study with an aim to classify functions / 
for which the connected /-factor problem is polynomial time solvable and those for which the 
problem is NP-Complete. In particular, our interest is to obtain a dichotomy for connected 
/-factor problem based on /. To conceptualize the nature of /, f{v) is taken to be at 
least n/g{n) for each v in V where g{n) is a function in o{n). In we have shown that 
the problem is NP-Complete when g{n) is for any constant e between 0 and 1. In 

recent work 12 , we showed that the problem is polynomial time solvable if g{n) = 3. 


While connected [a, 6]-factors are studied extensively from the point of view of identifying 
sufficient conditions, our work is on understanding how the computational complexity of 
connected /-factor problem vary with /. We summarize our results as follows: 


1. An algorithm running in time for deciding the existence of connected /-factor 

in a graph G where f{v) > njg{n) for each w in F and g{n) is in C>(polylog(n)). Clearly, 
the algorithm takes polynomial time when g{n) is a constant and quasi-polynomial time 
when g{n) is polylogarithmic in n. It is interesting that connected "-factor problem 
is polynomial time solvable for any constant c, as this refines the class of functions / 
for which the connected /-factor problem is NP-Complete: connected d-factor is NP- 
Complete for each constant d, as shown by Cheah and Cornell in . 

2. A refined characterization of graphs having connected /-factor where f{v) > n/g{n) for 
every v inV and g{n) is in C>(polylog(n)). 


3. An extension of the above mentioned algorithm to solve the minimum weighted connected 
/-factor problem where f{v) > n/g(n) for every v in V and g{n) is in (!l(polylog(n)), 
without increasing the asymptotic running time. 


4. 


Connected /-factor problem for f{v) > njgin) for every vinV and g{n) is in ©((log 
for any constant e > 0, is in N P-Intermediate under the ETH [^. Thus, this infinite class 
of problems parameterized by e is similar in complexity to the LOGCLIQUE 10 problem 
where the goal is to decide whether there exists a clique of size log n in an n-vertex graph. 


As a consequence of this work, we have a better refined understanding of computational 
complexity of the connected /-factor problem based on the nature of /. The main technique 
in this work is a natural way of converting one /-factor to another by exchanging a set of 
edges. This is formalized using the notion of Alternating Circuits that we use extensively 
in this work. We believe that these techniques for enforcing connectedness along with the 
results of Tutte 18 for finding /-factors plays an important role in understanding the nature 
of the connected /-factor problem for different classes of functions /. 
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Preliminaries 

2.1 Definitions and Notations 


We use standard definitions and notations from West 20 . G = {V, E) represents an undir¬ 
ected graph on n vertices, dc{v) denotes the degree of a vertex u in a graph G and N{v) 
denotes the open neighborhood of a vertex v. g{n) is in C>(polylog(n)) and / is a function 
whose domain is the vertex set of G and range is the set {\n/gin)], ... — 1}. Given 

two subgraphs Gi and G2 of a graph G, we use the basic definitions of binary operations 
Gi n G 2 , Gi U G 2 to be subgraphs obtained by the vertex and edge set intersection and union 
operations respectively. We define the symmetric difference Gi A G 2 between two spanning 
subgraphs Gi and G 2 of G to be the spanning subgraph whose edge set is G(Gi) A E{G 2 )- 
Further, the concepts of circuit, decomposition of a graph G, the subgraph of G induced by 
S C V denoted by G[5'] are standard. We use w(e) to represent weight of an edge e in a 
weighted graph and w{G) to denote the sum of weights of edges in G. 

Given a partition Q — {Qi,Q2, • • •, Qr} of the vertex set of G, a graph G/Q is constructed 
as follows: The vertex set of G/Q is Q. Gorresponding to each edge {u,v) in G where u in 
Qi, V in Qj, i yf j, there exists an edge {Qi, Qj) in G/Q. G/Q is a multigraph without loops. 
For a spanning subgraph G' of G, we say G' connects a partition Q if G'/Q is connected. 
A refinement Q! of a partition Q is a partition of V where each part Q' in Q' is a subset 
of some part Q in Q. This concept of partition refinement is from Kaiser [^. Whenever 
we say a spanning tree of G/Q, we refer to a spanning subgraph T of G having |Q|-1 edges 
that connects Q. 


2.2 Colored Graphs and Alternating Circuits 

A colored graph G is one in which each edge is assigned a color from the set {red, blue}. 
In a colored graph G, we use R and B to denote subgraphs of G whose edges are the set 
of red edges {E{R)) and blue edges {E{B)) of G, respectively, and V{R) = V{B) = V{G). 
We use this coloring in our algorithm to distinguish between edge sets of two distinct /- 
factors of the same graph G. A main computation step in our algorithm is to consider the 
symmetric difference between edge sets of two distinct /-factors and perform a sequence 
of edge exchanges preserving the degree of each vertex. The following definition is used 
extensively in our algorithm. 

► Definition 1. A subgraph S' of a colored graph G is an alternating circuit if S is a circuit, 
and there exists an Eulerian tour of S in which every pair of consecutive edges are of different 
colors. 

Glearly, an alternating circuit has an even number of edges and is connected. Further, 
dniv) = dsiv) for each v in S. We define a minimal alternating circuit S to be an alternating 
circuit where each vertex u in S has at most two red edges and two blue edges incident on 
it. 

► Definition 2. A spanning subgraph S of G is defined to be a switch on another spanning 
subgraph iJ of G if we could color edges in S D H with color red and those in S \ H with 
color blue such that each component in S is an alternating circuit. 

► Definition 3. For an S which is a switch on H, we define Switching(i?,S) to be a subgraph 
G' of G obtained by removing all edges in S'n from H and adding all the edges in S\H 
to H. 
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Whenever the operation Switching(_ff,S') is used, S is assumed to be a switch on H. Finally 
the weight of an alternating circuit S, denoted by bF(S'), is w{B) — w{R). This will be 
used along with switching operation. If G' =Switching(77,S'), then it implies that w{G') = 
w{H) + W{S). The weight of a switch S is also similarly defined to be w{S\H) — w{Sr]H). In 
our arguments we reason about an /-factor obtained by switching a sequence of alternating 
circuits, and for this we introduce the following notation. Let 5 be a set of edge disjoint 
alternating circuits each of which is a switch on H. Let S' = Us^sS. Then the operation 
Switching(7J, S) is the /-factor that results from Switching(iJ,S"). 

Unless otherwise mentioned, g{n) is a function in (!I(polylog(n)). We justify why this choice 
of g{n) is crucial for our analysis in Lemma 16 / is a function / : U —t N such that 


f{v) > ln/g(n)], for each u in U where n = \V\. A consequent fact is that, if H is an 
/-factor of G, then the number of components in H is at most g{n) — 1. We use two crucial 
subroutines from the literature- Tutte’s-Reduction(G,/) is a subroutine which outputs an 
/-factor of G(if one exists) using the reduction in |20| example 3.3.12]. Modified-Tutte’s- 
Reduction(G,/) is an extension of Tutte’s-Reduction(G,/), which computes a minimum 
weighted /-factor of the input weighted graph G by reducing it to the problem of finding 
a minimum weighted perfect matching [^[^. We assume that both the above subroutines 
return empty graphs if they fail to an compute /-factor. 


fT] Outline of the Algorithm and a refined Characterization 

The following is a natural characterization of graphs that have a connected /-factor, and it 
is almost a restatement of the definition of a connected /-factor. 

► Theorem 4. Let G be an undirected graph and f be a function / : U —>■ N. G has a 
connected f-factor if and only if for each partition Q of the vertex set V, there exists an 
f-factor H of G that connects Q. 

The forward direction of the proof is the observation that a connected /-factor connects 
any partition of the vertex set. The converse is proved by applying the hypothesis to the 
partition Q = {{t’l}, {^ 2 },.. ■, Theorem sets up the foundation of our algorithm 

outlined below. 

Outline of the search for connected /-factors: Here we set up the template to 
search for a connected /-factor in an input graph G based on Theorem The details 
are in Algorithm in section Our algorithm constructs a maximal sequence of pairs 
{Hq, Qo), {Hi, Qi), ..., {Hk, Qk) satisfying the following properties: 

1. Each Qi,0 < i < k is a partition of the vertex set V, and Qo = {^}- 

2. Each Hi,0 < i < k is an /-factor of G, and Hi connects Qi. 

3. For each 1 < i < k, Each Qi is a refinement of Qi-i satisfying the following: 

a. Each part in Qi is a maximal component in Hi-i[Y] for some Y in Qi-i- 

b. Qi yf Qi_i and hence |Qi| > |Qi-i| for 1 < z < fc. 

The meaning of maximality of the sequence is that the sequence we consider is not a prefix 
of a longer sequence satisfying the 3 conditions listed above. Since Qi is a refinement of 
Qi-i, it follows that k can be at most n. The following is an interesting and useful fact. 

► Fact 1. Let H be an /-factor of G and let Q be a partitioning of the vertex set V. If H/Q 
is connected and H[Q] is connected for each Q in Q, then H is a connected /-factor. 
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If a refinement of Qk satisfies conditions 1 and 3(a), is same as Qk, then it follows from Fact 
that Hk is a connected /-factor of G. On the other hand, if Hk is not a connected /-factor, 
then any refinement of Qk has the property that there is no /-factor of G that connects it. 
Otherwise, if there was some refinement that can be connected by some /-factor of G, there 
would be a violation to the maximality of the sequence. In this case, from Theorem]^ we 
can conclude that G does not have a connected /-factor. 

The algorithm we design directly evolves from Theorem^and it essentially computes a max¬ 
imal sequence {Hi, Qi ),0 < i < k of pairs satisfying the conditions outlined above. The al¬ 
gorithm initializes Qo = ^{G), and Hq to an arbitrary /-factor of G. The computation of the 
next pair in sequence is done by the subroutine Restricted-/-Factor(i?i_i,Qi_i) (described 
later in Section]^ which computes Qi from Qi-i and Further the algorithm performs 

a computation of Hi from and Qi. The procedure Restricted-/-Factor(i?i_i,Qi_i) 

mainly involves three computational subtasks for each 1 < i < fc. 


1 . 


2 . 


Computes Qi from Qi-\ by considering the subgraph Hi-i[Q] and finds components in 
Hi-i\Q] for each Q in Qi-i- These components are all the parts in Qi. 

A set of steps referred to as Partition-Connector(Qi) in the recursive procedure Restricted- 
/-Factor(i?i_i,Qi_i) does the following: takes the partition Qi as input and returns an 
/-factor H such that H/Qi is connected. This is achieved as follows: for each spanning 
tree T of G/Qi(recall definition), the procedure Partition-Connector(Qi) checks for an 
/-factor H[ containing E{T). If there exists one, then it returns H[ and an empty graph 


otherwise. The correctness of procedure Partition-Connector(Qi) uses Lemma 14 


3. 


A sequence of steps referred to as Next-Factor(iLi_i,iL',Qi) in procedure Restricted-/- 
Factor(iJi_i,Qi_i) computes Hi from H{ and Qi where H{ is an /-factor that 

connects Qi. To compute Hi we color edges in Hi-i with color red and those in H[ 
with color blue. Then we consider the symmetric difference S between and iJ'. 

Further we pick a set S of at most \Qi \ — 1 minimal alternating circuits in S such that 
Switching(iLi_i,5) connects Qi and this is the required Hi. The correctness proof of the 


computation of Hi from Hi-i and iJ' follows from Lemmas [13^ and 15 


Using the above three subtasks in Restricted-/-Factor(iLi_i,Qi_i), we ensure that at least 
f{v) — 2{g{n) — 1) edges incident on v in iLi_i is present in Hi for each v in V. We use this 
constructive computation of Hi from Hi-i to place a lower bound the size of the smallest 
part in Qi for every i < k. This gives an upper bound on the number of parts in Qi and this 
consequently bounds the number of recursive calls made to Restricted-/-Factor(7Ji_i,Qi_i), 
as |Qi| > \Qi-i\. These bounds are presented in Lemmas 18 and 19 Based on our algorithm. 


we come up with a refined characterization for graphs having a connected /-factor, which is 
proved in Section 


► Theorem 5. Let G be a graph and f be a function where f{v) > n/g{n) for every v in V 
and g{n) is a function in G(polylog(n)). G has a connected f-factor if and only if for each 
partition Q of size at most g{n), there exists an f-factor that connects Q. Further, Algorithm 
decides the existence of a connected f-factor in an input graph G in time (5(n^®*-"^). 


[~^ A spectrum of / for which Connected /-factor is in 
NP-Intermediate under ETH 


Let e > 0 be a fixed number, and let g{n) is in G((logn)^+*^). We consider the case when 
f{v) > n/g{n) for all v in V. In this section we show that for each such /, the connected 
/-factor problem is in NP-Intermediate under the ETH. 
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Connected n/0(polylog(n))-factor is unlikely to be NP-Complete. Assume that the 
connected /-factor problem is NP-Complete for some / satisfying the condition mentioned 
above. This implies there exists a polynomial time reduction from 3-SAT to the connected 
/-factor problem where the reduction algorithm outputs the graph G and the function / on 
the vertex set, both of which are polynomial in size of the instance of 3-SAT. Further, we 
reiterate, / satisfies the condition outlined above. From this reduction and the guarantee 
on our Algorithm in Theorem for connected /-factor, it follows that we have an algorithm 
which decides 3-SAT that runs in time and this is impossible under the ETH. 

The following lemma states this observation. 

► Lemma 6. The connected f-factor problem where f{v) > n/g{n) for every v in V and 
g{n) in 0((log is not UP-Complete for any e > 0 unless the ETH is false. 

Connected (n/0((logn)^+'^)-factor is unlikely to be in P. Here we assume that f{v) > 
njgin) where g(ri) is in 0((logfor some e > 0. Note that here e > 0 and in claim 
that it is unlikely to be NP-Complete, we had assumed that e > 0. We now present a sub- 
exponential time reduction R from the Hamiltonian cycle problem to the connected /-factor 
problem. The reduction algorithm takes G on N vertices and e > 0 as input and outputs a 
set Q of (|E(G)p) pairs. Each pair in Q is of the form (G', /) where / is a function and G' 
is a graph. The set Q satisfies the following: 

1. For each pair {G', f) in Q, G' is a graph having n vertices and f{v) > n/log^~^'^n for 
every v in G'. 

2. G has a Hamiltonian cycle if and only if there exists a pair (G', /) in Q such that G' has 
a connected /-factor. 

3. For each G' output by R, the number of edges in G' is where N is the number of 
vertices in G. 

The reduction R is as follows- 

1. Compute n = [2^ ^ where N = |E(G)|. 

2. Construct an empty graph G' containing n vertices. 

3. Define a partition Q of vertices in G' where each part contains at least r iogi+^ ri 1 ^ 

vertices and |Q| = — 2. The existence of partition Q is proved in lemma 

4. For each Q in Q, make G'[Q] a clique. For each v € Q, define dc'iv) = \Q\ — 1. 

5. Let A be a set consisting of exactly one vertex from each Q in Q, |A| = Ai — 2. 

6 . Let f{v) = da'{v) -\- 2 for each vertex v in A and f{v) = dc'iv) for each u in G' \ A. 

7. Let Ui be a vertex in G. 

8. For each 4-vertex path uq,ui,U 2 ,u^ in G, do the following: 

a. Let tr be a bijection from V{G) \ {ui,U 2 } to A. 

b. For each edge {u, ?;} in G\ {ui, W 2 }, add edge {a{u),a{v)} to G'.//Make G \ {ui,U 2 } 
isomorphic to G'[A] under a 

c. Fix /(cr(uo)) = da/v) -\- 1 and /(cr(u 3 )) = dc/v) -\- 1. //For a connected f-factor 
H' in G', the graph H'[A] is a spanning path with sigma{u/) and sigma{u/) as end 
points 

d. Output (G',/). 

e. //resetting G' for the next iteration 

f. Remove all the edges in G'[A] from G'. 

g. Fix /(cr(uo)) = da/v) 2 and f{a{u/)) = da/v) 2. 
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Observe that if the number of vertices TV in G is sufficiently large (depending on e), the total 
space required to hold Q output by R is 

► Lemma 7. Let m be an integer and let k < a/to. Then [to/([to/A:] + 1)J > k — 2. 

Proof in Appendix. 

The following lemma proves the correctness of the reduction. 

► Lemma 8. The graph G has a Hamiltonian cyele H if and only if R outputs a pair (G', /) 
such that G' has a connected f-factor H'. 

Proof in Appendix. 

If we have a polynomial time algorithm for the connected /-factor problem for a given 
constant e > 0, then we test for the existence of a connected /-factor of G' for each (G', /) 
in Q. The size of the set Q is 0{N^) where N = |y(G)|. Computation of each G' takes 
2 o(Af) Thus, in time 2°^^\ we check for the existence of a pair {G', f) in Q such that 

G' has a connected /-factor. 

► Lemma 9. Let e > 0 and let f(v) > n/g(n)) for every v in V and g{n) is in G((logn)^“'"'^). 
Then the connected f-factor problem is not in Punless the ETH is false. 

From lemmas and we come up with the following theorem. 

► Theorem 10. Let G be a graph having n vertices and f be a function where f(v) > n/g(n) 
for each v in V. For each e > 0 and each g{n) in G((log the connected f-factor 
problem is in HP-Intermediate unless the ETH is false. 


[~5l Properties of alternating circuits and /-factors 


To start with, we present properties of alternating circuits which we use extensively. Al¬ 
ternating circuits are intricately related to /-factors as they provide a way of moving from 
one /-factor to another. We present the following lemmas from our previous work in 
and the proofs of the lemmas in this section, which are necessary are in the appendix. 


12 


► Lemma 11. Let T be a graph in which each edge is assigned a color from the set {red, blue}. 
Each component in T is an alternating circuit if and only if dniv) = dB(v) for every v in 
T. 


Consider two /-factors Hi and H 2 of a graph G. If color the edges in Hi with color red 
and those in H2 with color blue, then each component in Hi A H2 is an alternating circuit. 
Note that if two alternating circuits Ti and T 2 have a vertex in common, then Ti U T 2 is an 
alternating circuit. 

► Lemma 12. Let H and H' be two f-factors of G. Lf T = H A H' (symmetric difference 
of the edge sets) then T is a switch on both H and H'. 

► Lemma 13. Let H be a subgraph of G and let T be a switch on H. Assign color red 
to edges in T f] H and blue to those in T \ H. Lf T is a minimal alternating circuit and 
G' = Switching(H,T), then \Nh{v) D Nc'{v)\ > d{v) — 2, for each v in V. 

► Lemma 14. Let S C E{G). An f-factor H containing all the edges in S, if one exists, 
can be computed in polynomial time. 




8 


A Classification of Connected /-factor Problems inside NP 


Decomposing an alternating circuit into minimal alternating circuits. In our 

algorithm we repeatedly take an alternating circuit and decompose into a set of minimal 
alternating circuits containing a given set of edges. The function Min-AC-Set([/,S') in 12 


take an alternating circuit U and a set of edges 5 C 17 as input and output a set U of edge 
disjoint minimal alternating circuits each of which is present in U. Further, each edge in 
S is present in some minimal alternating circuit C in U. Min-AC-Set(17,5') identifies an 
alternating circuit C having dR{v) and dsiv) at most 2 and adds it to U only if some edge 
in S is present in C. Further it removes the identified C from U. This step is repeated until 
U is empty. The crucial step in Min-AC-Set(?7,iS') is to find a minimal alternating circuit in 
U. This is presented in the recursive Procedure Find-Min-AC(17)(in Appendix). 


► Lemma 15. The procedure Min-AC-Set(U,S) outputs a set U of edge disjoint minimal 
alternating circuits each of which has at least one edge from S. 
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Algorithm for computing a Connected /-factor 


In this section we complete the algorithm outlined in Section The algorithm takes an 
unweighted graph G and a function / as input and outputs a connected /-factor of G if it 
exists. When the function f(v) > n/g(ri) for each v in V and g(n) is polylogarithmic in 
n, the algorithm runs in time We start with a justification of why g{n) being 

polylogarithmic in n is crucial for our analysis. 


► Lemma 16. Let g be a function on the set of positive integers, 
constant b such that for each positive integer n, g{n) < b{logn)^. 
2{g{n))'^ is at most n for a sufficiently large constant uq. 


Let there be a positive 
Then for each n > no, 


The proof of the above lemma is easy as njTb'^ is asymptotically larger than (log for 
any constant h. Lemma 16 is crucial in the analysis of our algorithm. Algorithm [^processes 
the input graph based on n. If n is smaller than no, it exhaustively checks for a connected 
/-factor. If n is at least no, then by Lemma 16 n > 2(5(n))^ and we use this to bound the 
running time of our algorithm. 

Description of the Algorithm The idea is to start with an arbitrary /-factor LTo of G and 
compute a connected /-factor using the template in Section]^ We use a recursive subroutine 
Restricted-/-Factor(i7i_i,Qi_i) which returns a connected /-factor if it exists or it returns 
an empty graph otherwise. 
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1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 
19 


Input:G{V, E),f 

0utput:G', a connected /-factor of G if exists, and outputs failure otherwise. 

if n < no then 

exhaustively search for a connected /-factor. 

Return the connected /-factor if it is found, else return failure. 

end 

iJo=Tutte ’s-Reduction (G, /). 
if Hq = empty then 
I then exit reporting failure. 

end 

if Hq is connected then 
I Output Hq and exit. 

end 

Qo—{V{G)}. II Initialize the root partition Qo with a single part V{G) 
G'=Restricted-/-Factor (i/o, Qo) • 

if G' is nonempty then 
I Output G' and exit, 
end 

Output failure. 


Algorithm 1: The Algorithm for deciding connected /-factor when g{n) is in 
G(polylog(n)) 


The following lemma plays a critical role in the correctness of our subroutine Restricted-/- 
Factor(i/i_i,Qi_i). 


► Lemma 17. Let G be a graph having a connected f-factor. Let Q be a partition of the 
vertex set V. There exists a spanning tree T of G/Q and an f-factor H of G such that 
E(T) C E{H). Further given T, H can be computed in polynomial time. 


Proof. Let G' be a connected /-factor of G. For any partition Q of the vertex set, G'/Q is 
connected. Consider a spanning tree T of G' jQ. Clearly, there exists at least one /-factor 
H containing E{T) and hence H/Q is connected. Once we have E{T), H can be computed 
in polynomial time using Lemma |14[ ◄ 


We now present the recursive procedure Restricted-/-Factor() which expands the outline in 
Section [H 
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1 Procedure Restricted-/-Factor(i7i_i,Qi_i) 

2 

Qt=empty. 

3 

for each X G Qi-i do 

4 


Qi—QiU{y\y is vertex set of a maximal component in i7[A]}. 

5 

end 

6 

if Qi = Qi-i then 

7 


return i7i_i and exit. H Hi-i is a connected /-factor 

8 

end 

9 

G'=empty. 

10 

H[=empty. 

11 

H BEGIN Partition-Connector(Qi) 

12 

for each spanning tree T ofG/Qi do 

13 


T=T \ Hi_i.ll Ignore edges that are already there in Hi_i 

14 


if an f-factor H[ containing E{T) exists then 

15 


1 exit Loop 2.11 steps 12... 18 

16 


end 

17 

end 

18 

if H[==empty then 

19 


exit and return empty./' There does not exist an /-factor that connects Qi 

20 

end 

21 

H END Partition-Connector (Qi) 

22 

// BEGIN Next-Factor(iJ,_i,i7',Qi) 

23 

5 

=E{H,_i)AE{Hi). 

24 

S=empty.// Set of minimal alternating circuits containing E{T) 

25 

for each component U G S do 

26 


5=5UMin-AC-Set {U,T) 

27 

end 

28 

Hi =Switching(i7i_i,iS). II Hi is an /-factor containing E(T') 

29 

H END Next-Factor(i7i_i,iJ',Qi) 

30 

G' =Restricted-/-Factor {Hi,Qi). 

31 

return G'. 


Algorithm 2: The procedure Restricted-/-Factor() recursively compute a connected 
/-factor of G, if it exists. 


We use the following lemma in arguing the correctness of our algorithm. 

► Lemma 18. If G has at least 2{g{n))'^ vertices, then in each recursive call to Restricted- 
f-Factor(Hi-i,Qi-i), the number of parts in Qi-i is at most g{n). 


Proof. Consider the computation of Hi from Hi-i in the first call to the subroutine with 
parameters Hq and Qq from Algorithm 1. In each iteration of loop 2, the number of edges in 
T is at most g{n) — 2. This is because the number of components in Hq is at most g{n) — 1. 
Consider S computed in step 24. We color edges in Hi-i n S with color red and those in 
H[ n S with color blue. From Lemma 15 minimality of each s in 5 computed in step 27 
and Lemma 13 |A//._j(v) D A//. (u)| is at least n/g{n) — 2{g{n) — 2) for each vertex v in V. 
Assume that there exists a recursive call in which the number of parts in Qi-i is more than 
g{n). We prove that this contradicts our premise that G has at least 2{g{n))'^ vertices. Let 
the pairs {Hq, Qo)) {Hi, Qi),. .., {Hk, Qk), - ■ ■ be the sequence of arguments to Restricted- 
/-Factor(i7.i_i,Qi_i). Let the number of parts in Qk be larger than g{n) and for each 
0<*<fc — 1, |Qi|< g{n). As discussed above, k can not be 0. Observe that for each 
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I <i <k, \Qi\ > |Qi-i| and |Qo| = 1- This implies \Qi\ >i + l for every i. Thus the level 
number A: — 1 is at most g(n) — 1 and hence k is at most g(n). Let T be a spanning tree 
in GjQi and let H[ be the /-factor in the i-th recursion that contains T(connects Qi). Let 
S be the symmetric difference E{Hi-i) A E{H^), and let S be the subset of decomposition 
of S into minimal alternating circuits which we use for switching in step 29. For each 
0 < i < k — 1, Qi has at most g{'n) parts. The number of parts in Qk-i computed in the 
recursive call Restricted-/-Factor(7Jfc_2,Qfe-2) is at most g{n). This implies in each of those 
recursive calls Restricted-/-Factor(7Li,Qi) where 0 < i < k — 2, the number of edges in T 
computed in step 13 is at most g{n) — 1. Consequently, from Lemma 15 the number of 
minimal alternating circuits in S is at most g{n) — 1 for each recursive call with parameters 
{Hi,Qi) for 0 < i < fc — 1. Thus, from Lemma 13 the size of no<i<fe-ilVi/i(u) is at least 
n/g{n) - 2{g{n) - l)(fc - 1). Further, \NHoiv) D > n/g{n) - 2{g{n) - l){k - 1) 

where Hk-i computed at the end of the call Restricted-/-Factor(iLfc_ 2 ,Qfc- 2 )- This means 
for each vertex v in V, at least n/g{n) — 2{g{n) — l)(fc — 1) edges incident on v in iLfc-i 
were also present in Hq. Since k < g{n), we get the size of Nho{v) n Nn^-iiv) to be 
at least n/g{n) — 2{g{n) — l){g{n) — 1). Further, each part in Qk computed in the call 
Restricted-/-Factor(iLfc_i,Qfc_i) has more than n/g{n) — 2{g{n) — l){g{n) — 1) vertices. 
This implies that the total number of vertices counted in the parts of Qk is more than 
{g{n) + 1) • {n/g{n) — 2{g{n) — l){g{n) — 1)). Clearly, the total number of vertices n should 
be larger than ~ 2((/(n))^). By rearranging the terms we get n < 2{g{n))‘^. This 

contradicts our premise that G has at least 2{g{n))‘^ vertices. Therefore, our assumption 
that \Qk\ > g{n) is wrong. Hence the lemma. ◄ 


► Lemma 19. Let n he at least 2{g{n))‘^. The number of times the subroutine Restricted-f - 
Factor(E[i_i,Qi_i) gets invoked recursively is at most g{n) — 1 . 

Proof. This follows immediately from Lemma as the number of parts in Qi is at most 
g{n), and between two consecutive reclusive calls to Restricted-/-Factor(iJi_i,Qi_i), either 
a connected /-factor is found or we have a Qi larger than Qi-i to work with. ◄ 


Rest of the section contains the proof of the results discussed in Section 

Proof of Running time in Theorem If G has an /-factor, then step 7 of Algorithm 
[^computes an arbitrary /-factor Hq. The first call to Restricted-/-Factor(i7i_i,Qi_i) is 
made with arguments Hq and Qq = {R(G)}. In the z-th recursive call: if Qi and Qi-i 
are the same, then a connected /-factor is found, and this check can be done in polynomial 
time, and the correctness is by Fact If there does not exist a /-factor H[ that connects 
Qi, then the algorithm exits at this step. This check is done by loop 2, and since the 
number of parts is upper bounded by g{n), the time taken is at most where m 

is the number of edges in G. This is the time taken to enumerate all spanning trees T of 
GjQi and to check if there is a /-factor that contains the edges of T. Thus Algorithm 1 
completes in time If G has a connected /-factor, then in at most g(n)-recursive 

calls, the Restricted-/-Factor(i7i_i,Qi_i) will succeed due to Theorem]^ Further, by the 
same theorem, if G does not have a connected /-factor, the procedure will terminate by 
identifying a partition Q that cannot be connected by an /-factor of G. Hence the theorem. 

◄ 


Proof of Characterization in Theorem |5l The forward direction of the proof is implied 
by Theorem]^ as a connected /-factor connects any partition, independent of the size of the 
partition. The reverse direction of the proof is from Algorithm [T] For each partition Q 
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of size at most g{n), if there exists an /-factor that connects Q then clearly the algorithm 
computes a connected /-factor with in at most g{n) — 1 recursions. ◄ 

7 I Computing a minimum weighted connected /-factor 

In this section we consider a variant where the input graph G has positive weights on the 
edges, and the objective is to compute a minimum weighted connected /-factor. We consider 
the case where f{v) > n/g{n) for some function g{n) in 0(polylog(n)). We extend Algorithm 
[2 to solve this minimization problem as follows- 

1. In step 7 of Algorithm instead of initializing Hq with an arbitrary /-factor, we use 
Modified-Tutte’s-Reduction(G',/) to initialize Hq with a minimum weighted /-factor of 
G. 

2. Further in loop 2 in Restricted-/-Factor(i7i_i,Qi_i), H[ is the minimum weighted /- 
factor that connects Qi, if there exists one. 

It is clear, from the arguments in Theorem that these modifications still guarantee that 
the output will be a connected /-factor if one exists, and we have to show that the procedure 
computes an /-factor of minimum cost. We refer to the above extension of Restricted-/- 
Factor(i7i_i,Qi_i) as Restricted-Min-/-Factor(i7i_i,Qi_i). 

To understand the behavior of the modification, recall that Hq is a minimum weight /-factor 
of G. Secondly, let be a minimum weighted /-factor that connects Qi identified in the 
i-th recursion. The procedure builds Hi from Hi-i and iJ/ The following lemma is used in 
bounding the cost of Hi in the i-th recursive call to Restricted-Min-/-Factor(i7i_i,Qi_i). 

► Theorem 20. Let H be a minimum weighted f-factor of G and let T C E{G) \ E{H). 
Let H' be a minimum weighted f-factor among all f-factors containing T. Let S = E(H) A 
E{H') and let the edges of SC\H be colored red and the edges of S D H' be colored blue. Let 
S be a partition of E(S) into minimal alternating circuits. The following are true: 

1. For each s in S, if W{s) >0 then s n T yf /. 

2. For any S' C S satisfying T C Uggs'S, Switching(H,T' ) is an f-factor of weight exactly 
equal to w{H'). 


The proof of Theorem 20 which is also in our recent paper 12 , is in the Appendix. The 


following lemma highlights an invariant which plays a critical role in arguing the correctness 
of the subroutine Restricted-Min-/-Factor(). 


► Lemma 21. Consider the sequence of arguments {Hq, Qq), {Hi, Qi), ..., {Hj., Q^) to the 
procedure Restricted-Min-f -Factor(). Let the input graph G has a connected f-factor. The 
cost of Hk is equal to that of H'). in the k-th recursive call to Restricted-Min-f-Factor (). 

Proof. For fc = 1, Theoremj^directly completes the proof. For fc > 1, we prove by induction 
on the recursion level i. We assume the claim to be true for k—1 and we show this to be true 
for k. We use S.i and Si to address the value of variables S and S respectively, computed 
in step 28 of the function Restricted-Min-/-Factor(iJi_i,Qi_i). Assume that there exists a 
minimal alternating circuit s in H'j. A Hk which is a switch on H^ of negative weight. 

We now derive a contradiction to the optimality of Note that s is edge disjoint 

from Us'GSfc because, A Hk is Sk \ Thus s is a switch on 

Switching(iJfc,5fc) of negative weight. Further, Switching(i7fc,5fe) is Hk-i. By induction, 
Hk-i is of weight H'f._^ computed in the previous level of recursion. From the algorithm. 
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Switching(i7fe_i,s) connects Qk- From the fact that Qk is a refinement of Qk-i, it follows 
that Switching(iJfc-i,s) is an /-factor that connects Qk-i and is of weight less than that 
of and this contradicts the optimality of We now assume that there does not 

exist a minimal alternating circuit s of negative weight in Hk A which is a switch on Hk- 
We show that such a switch s of positive weight also cannot occur. Note that such a positive 
weighted switch s is a switch on H'j, of negative cost. As described above, s is edge disjoint 
from Us'GSfc From the algorithm, Switching(i7^,s) connects Qk and is of lower cost 

than H'f,. Thus, We have a contradiction to the optimality of H'f.. Therefore, the cost of Hk 
is equal to that of H'^, in the fc-th recursive call to Restricted-Min-/-Factor(). ◄ 
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Appendix 


Proof of Lemma [71 Let I = +1. Then {l—2)k < m. From k < y/m, {1—2.) > {k—2). 

We have, 

{I — 2)k < m 

{l-2)k-2{l-2) + 2{k-2) < m 

l{k — 2) < m 

Proof of Lemma [sl Consider the vertex ui selected in step 7 of R. Corresponding to each 
Hamiltonian cycle H in G there exists a unique 4-vertex path uo,ui,U 2 ,U 3 in G and an 
associated iteration of step 8 . Note that H \ {ui,U 2 } is a path L of length N — 2 whose 
end vertices are uq and U3 . We have a corresponding spanning path L' of length A — 2 in 
G'[A]. Removing all the edges in G'[A\ \ L' from G'[A\ gives a connected /-factor H' of G'. 
Conversely, given a connected /-factor H', we pick the edge {B~^{u), B~^{v)} for each edge 
{u,z;} in H'\A\. This gives a path L of length N — 2 whose end vertices are some uq and 
U3. From the reduction algorithm, there exists another path uo,ui,U2,U3 where ui and U2 
are the vertices that are not present in V{L) and we have a Hamiltonian cycle in G. ◄ 


Proof of Lemma 1111 In the forward direction, consider a component G in T. Since there 
is an Eulerian tour in G in which consecutive edges are of different colors, it follows that 
dniv) = dsiv) for all v in T. In the reverse direction, let d]i{v) = dsiv) for all v in T. 
To complete the proof, we point to an exercise in 20 exercise 1.2.35] which considers the 


formulation of Tucker’s algorithm for computing an Eulerian circuit in [16] . First, at each 
vertex n in a component G in T, we pair each red edge incident on to a distinct blue edge 
incident on v. Since dji{v) = dsiv), such a pairing is guaranteed to exist. Secondly, using 
this pairing, the required alternating circuit is the Eulerian circuit constructed by Tucker’s 
algorithm. Hence the lemma. ◄ 


Proof of Lemma 1131 Since T is minimal, by definition, we know that dii{v) = dsiv) for 
each V in V. Consequently, not more than 2 edges incident on a vertex will be removed from 
H as a result of applying Switching(iL,T). Therefore, the number of common edges incident 
on V in both G' and H is at least d{v) —2. ◄ 

Proof of Lemma I14L Observe that removing the set of edges S from an /-factor H con¬ 
taining S, reduces the degree of each vertex w in iL by € S'}]. This is exactly an 

/'-factor of G{V,E \ S) where f'{v) = f{v) — |{(r','u) G S'}], for each v in V. Computing /' 
and then computing an /'-factor H' of G{V,E \ S) is easy. Recall that in polynomial time 
we can compute an /'-factor, if one exists, see West [^. Further adding the edges in S to 
H' gives an /-factor H of G containing S. ◄ 
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Proof of Lemma 1171 Let G' be a connected /-factor of G. For any partition Q of the 
vertex set, from Theorem]^ G'/Q is connected. Consider a spanning tree T of G'jQ. 
Clearly, there exists at least one /-factor H containing E{T) and hence H/Q is connected. 


Once we have E(T), H can be computed in polynomial time using Lemma 14 


Proof of Theorem 1201 For any minimal alternating circuit t which is a switch on H, recall 
that W{t), the weight of t, is r(;(Switching(7J,t)) — w{H). Since H is optimum, for each t 
in T, W{t) = w{Switching{H,t)) — w{H) > 0. Suppose there exists a minimal alternating 
circuit f in T such that W{t) > 0, and t does not contain any of the edges in S. Let us 
consider T' = T \ t, that is T' is an alternating circuit obtained by removing the edges of t 
from T, then W{T') = W{T) — W{t). Then Switching(iJ,T') is an /-factor containing S, and 
w(Switching(£r,r')) = w{H) + W{T') = w{H) + W{T) - W{t) = w{H') - W{t) < w{H'). 
This contradicts the optimality of H'. Therefore, t D S (j). This implies for any subset 
T' C T such that 5 C (J E{t), i(;(Switching(iL,T')) = w{H'). ◄ 

iGT' 
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10 

11 

12 

13 

14 

15 

16 

17 

18 
19 


if dii{u) = dsiu) < 2 for each u in U. then 
I Exit and return U. // U is a. minimal alternating circuit 

end 

For each u in U, pair each blue edge incident on it to a distinct red edge incident 
on It. 

exercise 1.2.35] on U using the pairing defined in the 


20 


Run Tucker’s algorithm 

previous step to get an Euler tour T in which consecutive edges are of different 
colors. 

Let u be a vertex with dfi{v) > 2 in V.// Such a v exists in U 
Start the tour T from v and let ei be the edge through which T leaves v for the 
first time and let 62 be the edge through which T makes the first return to v. Let 
63 be the edge through which T continues the tour and let 64 be the edge through 
which 7 ” makes the next return to v 
if color{e 2 ) yf color{ei) then 
U' = i;,ei,... , 62 ,u. 
else if color^Ci) yf color^e^) then 
U' = V,e3, ... , 64 , w. 
else 

17' = i;, 61 ,..., 64 , V.// color{ei) yf color{e 4 ) 

end 
end 
end 

Return Find-Min-AC(17'). 


Algorithm 3: The procedure Find-Min-AC(17) returns a minimal alternating circuit 
in U 










